Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis

نویسندگان

Markus Toman

Michael Pucher

Dietmar Schabus

چکیده

In this paper we apply adaptive modeling methods in Hidden Semi-Markov Model (HSMM) based speech synthesis to the modeling of three different varieties, namely standard Austrian German, one Middle Bavarian (Upper Austria, Bad Goisern), and one South Bavarian (East Tyrol, Innervillgraten) dialect. We investigate different adaptation methods like dialectadaptive training and dialect clustering that can exploit the common phone sets of dialects and standard, as well as speakerdependent modeling. We show that most adaptive and speakerdependent methods achieve a good score on overall (speaker and variety) similarity. Concerning overall quality there is no significant difference between adaptive methods and speakerdependent methods in general for the present data set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A style control technique for singing voice synthesis based on multiple-regression HSMM

This paper proposes a technique for controlling singing style in the HMM-based singing voice synthesis. A style control technique based on multiple regression HSMM (MRHSMM), which was originally proposed for the HMM-based expressive speech synthesis, is applied to the conventional technique. The idea of pitch adaptive training is introduced into the MRHSMM to improve the modeling accuracy of fu...

متن کامل

Visual control of hidden-semi-Markov-model based acoustic speech synthesis

We show how to visually control acoustic speech synthesis by modelling the dependency between visual and acoustic parameters within the Hidden-Semi-Markov-Model (HSMM) based speech synthesis framework. A joint audio-visual model is trained with 3D facial marker trajectories as visual features. Since the dependencies of acoustic features on visual features are only present for certain phones, we...

متن کامل

Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005

In January 2005, an open evaluation of corpus-based textto-speech synthesis systems using common speech datasets, named Blizzard Challenge 2005, was conducted. Nitech group participated to this challenge with a newly designed HMM-based speech synthesis system (Nitech-HTS 2005). In the present paper, technical details, building processes, and the performance of the Nitech-HTS 2005 voices are des...

متن کامل

An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005

In the present paper, hidden Markov model (HMM) based speech synthesis system developed in Nagoya Institute of Technology (Nitech-HTS) for a competition of text-to-speech synthesis systems using the same speech databases, named Blizzard Challenge 2005, is described. We show an overview of the basic HMM-based speech synthesis system and then recent developments to the latest one such as STRAIGHT...

متن کامل

An Overview of Nitech HMM-based for Blizzard Challen

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis

نویسندگان

چکیده

منابع مشابه

A style control technique for singing voice synthesis based on multiple-regression HSMM

Visual control of hidden-semi-Markov-model based acoustic speech synthesis

Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005

An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005

An Overview of Nitech HMM-based for Blizzard Challen

عنوان ژورنال:

اشتراک گذاری